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Abstract 

We study some methods of subgradient projections for solving a 
convex feasibility problem with general (not necessarily hyperplanes 
or half-spaces) convex sets in the inconsistent case and propose a strat- 
egy that controls the relaxation parameters in a specific self-adapting 



manner. This strategy leaves enough user-flexibihty but gives a math- 
ematical guarantee for the algorithm's behavior in the inconsistent 
case. We present numerical results of computational experiments that 
illustrate the computational advantage of the new method. 

1 Introduction 

In this paper we consider, in an Euclidean space framework, the method of 
simultaneous subgradient projections for solving a convex feasibility prob- 
lem with general (not necessarily linear) convex sets in the consistent and 
inconsistent cases. To cope with this situation, we propose two algorithmic 
developments. One uses steering parameters instead of relaxation parame- 
ters in the simultaneous subgradient projection method, and the other is a 
strategy that controls the relaxation parameters in a specific self-adapting 
manner that leaves enough user-flexibility while yielding some mathemati- 
cal guarantees for the algorithm's behavior in the inconsistent case. For the 
algorithm that uses steering parameters there is currently no mathematical 
theory. We present numerical results of computational experiments that show 
the computational advantage of the mathematically-founded algorithm im- 
plementing our specific relaxation strategy. In the remainder of this section 
we elaborate upon the meaning of the above-made statements. 

Given m closed convex subsets Qi,Q2,- ■ ■ , Qm ^ -R" of the n-dimensional 
Euclidean space, expressed as 

g, = {xGi?"|/,(x)<0}, (1.1) 

where /« : i?"" — ;> i? is a convex function, the convex feasibility problem (CFP) 



IS 



find a point x* e Q := n^^g^. (1.2) 



As is well-known, if the sets are given in any other form then they can be rep- 
resented in the form (II. ip by choosing for /, the squared Euclidean distance 
to the set. Thus, it is required to solve the system of convex inequalities 

fi{x)<0, t = l,2,...,m. (1.3) 

A fundamental question is how to approach the CFP in the inconsistent 
case, when Q = fl™ ^g^ = 0. Logically, algorithms designed to solve the CFP 
by finding a point x* & Q are bound to fail and should, therefore, not be 



employed. But this is not always the case. Projection methods that are 
commonly used for the CFP, particularly in some very large real-world ap- 
plications (see details below) are applied to CFPs without prior knowledge 
whether or not the problem is consistent. In such circumstances it is imper- 
ative to know how would a method, that is originally known to converge for 
a consistent CFP, behave if consistency is not guaranteed. 

We address this question for a particular type of projection methods. In 
general, sequential projection methods exhibit cyclic convergence in the in- 
consistent case. This means that the whole sequence of iterates does not con- 
verge, but it breaks up into m convergent subsequences (see Gubin, Polyak 
and Raik [3H Theorem 2] and Bauschke, Borwein and Lewis [S]). In contrast, 
simultaneous projection methods generally converge, even in the inconsistent 
case, to a minimizer of a proximity function that "measures" the weighted 
sum of squared distances to all sets of the CFP, provided such a minimizer 
exists (see lusem and De Pierro [37] for a local convergence proof and Com- 
bettes [26] for a global one). 

Therefore, there is an advantage in using simultaneous projection meth- 
ods from the point of view of convergence. Additional advantages are that 
(i) they are inherently parallel already at the mathematical formulation level 
due to the simultaneous nature, and (ii) they allow the user to assign weights 
(of importance) to the sets of the CFP. However, a severe limitation, common 
to sequential as well as simultaneous projection methods, is the need to solve 
an inner-loop distance-minimization step for the calculation of the orthogo- 
nal projection onto each individual set of the CFP. This need is alleviated 
only for convex sets that are simple to project onto, such as hyperplanes or 
half-spaces. 

A useful path to circumvent this limitation is to use subgradient pro- 
jections that rely on calculation of subgradients at the current (available) 
iteration points, see Censor and Lent [20] or [211 Section 5.3]. lusem and 
Moledo [H] studied the simultaneous projection method with subgradient 
projections but only for consistent CFPs. To the best of our knowledge, 
there does not exist a study of the simultaneous projection method with 
subgradient projections for the inconsistent case. Our present results are a 
contribution towards this goal. 

The CFP is a fundamental problem in many areas of mathematics and 
the physical sciences, see, e.g., Combettes [25], [27] and references therein. It 
has been used to model significant real-world problems in image reconstruc- 
tion from projections, see, e.g., Herman [32], in radiation therapy treatment 



planning, see Censor, Altschuler and Powlis [TTj and Censor [15], and in 
crystallography, see Marks, Sinkler and Landree [12], to name but a few, 
and has been used under additional names such as set-theoretic estimation 
or the feasible set approach. A common approach to such problems is to 
use projection algorithms, see, e.g., Bauschke and Borwein ^, which employ 
orthogonal projections (i.e., nearest point mappings) onto the individual sets 
Qi. The orthogonal projection Pn{z) of a point z G i?" onto a closed convex 
set f] C i?" is defined by 

Pq{z) '■= argminjll z — x || | x G Q}, (1-4) 

where, throughout this paper, || ■ || and (■, ■) denote the Euclidean norm 
and inner product, respectively, in i?". Frequently a relaxation parameter is 
introduced so that 

Pn,xiz):=il-\)z + \Pn{z) (1.5) 

is the relaxed projection of z onto Q with relaxation A. Many iterative pro- 
jection algorithms for the CFP were developed, see Subsection 11.11 below. 

1.1 Projection methods: Advantages and earlier work 

The reason why the CFP is looked at from the viewpoint of projection meth- 
ods can be appreciated by the following brief comments, that we made in 
earlier publications, regarding projection methods in general. Projections 
onto sets are used in a variety of methods in optimization theory but not 
every method that uses projections really belongs to the class of projection 
methods. Projection methods are iterative algorithms which use projections 
onto sets. They rely on the general principle that projections onto the given 
individual sets are easier to perform then projections onto other sets derived 
from the given individual sets (intersections, image sets under some trans- 
formation, etc.) 

A projection algorithm reaches its goal, related to the whole family of sets, 
by performing projections onto the individual sets. Projection algorithms 
employ projections onto convex sets in various ways. They may use different 
kinds of projections and, sometimes, even use different types of projections 
within the same algorithm. They serve to solve a variety of problems which 
are either of the feasibility or the optimization types. They have different 
algorithmic structures, of which some are particularly suitable for parallel 



computing, and they demonstrate nice convergence properties and/or good 
initial behavior patterns. 

Apart from theoretical interest, the main advantage of projection meth- 
ods, which makes them successful in real-world applications, is computa- 
tional. They commonly have the ability to handle huge-size problems that 
are beyond the ability of more sophisticated, currently available, methods. 
This is so because the building blocks of a projection algorithm are the pro- 
jections onto the given individual sets (which are easy to perform) and the 
algorithmic structure is either sequential or simultaneous (or in-between). 

The field of projection methods is vast and we mention here only a few 
recent works that can give the reader some good starting points. Such a list 
includes, among many others, the works of Crombez [23130], the connection 
with variational inequalities, see, e.g., Aslam-Noor [13], Yamada [H] which 
is motivated by real-world problems of signal processing, and the many con- 
tributions of Bauschke and Combettes, see, e.g., Bauschke, Combettes and 
Kruk [6] and references therein. Bauschke and Borwein |@] and Censor and 
Zenios fU\ Chapter 5] provide reviews of the field. 

Systems of linear equations, linear inequalities, or convex inequalities are 
all encompassed by the CFP which has broad applicability in many areas 
of mathematics and the physical and engineering sciences. These include, 
among others, optimization theory (see, e.g., Eremin [33], Censor and Lent 
[2U] and Chinneck [22]), approximation theory (see, e.g., Deutsch [HI] and 
references therein), image reconstruction from projections in computerized 
tomography (see, e.g., Herman [351 136]) and control theory (see, e.g., Boyd 
et al. [8].) 

Combettes [21] and Kiwiel [30] have studied the subgradient projection 
method for consistent CFPs. Their work presents more general algorithmic 
steps and is formulated in Hilbert space. Some work has already been done on 
detecting infeasibility with certain subgradient projection methods by Kiwiel 
[38l [39j. However, our approach differs from the latter in that it aims at a 
subgradient projection method that "will work" regardless of the feasibility of 
the underlying CFP and which does not require the user to study in advance 
whether or not the CFP is consistent. Further questions arise such as that of 
combining our work, or the above quoted results, with Pierra's [H] product 
space formalism, as extended to handle inconsistent situations by Combettes 
|26j . These questions are currently under investigation. 



2 Simultaneous subgradient projections with 
steering parameters 

Subgradient projections have been incorporated in iterative algorithms for 
the solution of CFPs. The cyclic subgradient projections (CSP) method for 
the CFP was given by Censor and Lent [20] as follows. 



Algorithm 2.1 The method of cyclic subgradient projections (CSP). 
Initialization: x^ G R"' is arbitrary. 
Iterative step: Given x^, calculate the next iterate x'^'^^ by 

^k+i ^\x^- aj-^^t\ if /,(,)(x'=) > 0, ^2.1) 

1 x\ if /.(fc)(x^)<0, 

where t^ G dfi(k){x^) is a subgradient of fi(^k) o,t the point x^ , and the relax- 
ation parameters {afcjfc^o ^'^^ confined to an interval ei < a^ < 2 — €2, for 
all k>0, with some, arbitrarily small, €1,62 > 0. 

Control: Denoting I := {1,2, ...,m}, the sequence {i{k)}'^^Q is an al- 
most cyclic control sequence on I. This means (see, e.g., Wl\ Definition 
5.1.1]) that i{k) G / for all k > and there exists an integer C > m such 
that, for allk>0, I C {i{k + l),i{k + 2), . . . ,i{k + C)}. 

Observe that if t'^ = 0, then /j(fc) takes its minimal value at x'^, implying, 
by the nonemptiness of Q, that fi(k){.x^) < 0, so that x'^^^ = x^. Relations 
of the CSP method to other iterative methods for solving the convex feasi- 
bility problem and to the relaxation method for solving linear inequalities 
can be found, e.g., in ^1], Chapter 5], see also, Bauschke and Borwein [H 
Section 7]. Since sequential projection methods for CFPs commonly have 
fully-simultaneous counterparts, the simultaneous subgradient projections 
(SSP) method of Dos Santos [32] and lusem and Moledo [H] is a natural 
algorithmic development. 

Algorithm 2.2 The method of simultaneous subgradient projections (SSP). 

Initialization: x^ G R"' is arbitrary. 

Iterative step: (i) Given x'', calculate, for all i E I = {1,2, ...,m}, 
intermediate iterates y^+^'^ by 



/n^ ) fk ■£ f.(rf.k\ 

II ,k \\2^ ' ^^ JA-'^ ) 

x\ if fi{x^) < 0, 



yk+i,i^) ^^-Qfc ip^fc ||2 ^^ if /i(a;''0 > 0, .2.2) 



where t^ G dfi{x^) is a suhgradient of fi at the point x^ , and the relaxation 
parameters {a^j^Q are confined to an interval ei < a^ < 1 — t2, for all 
k>0, with some, arbitrarily small, ei,e2 > 0. 
(ii) Calculate the next iterate x'^'^^ by 



x'^' 



J2^^y'^''' (2-3) 



where Wi are fixed, user-chosen, positive weights with YllLi '^i = 1- 

The convergence analysis for this algorithm is currently available only for 
consistent [Q ^ 0) CFPs, see [321 Hi] . In our experimental work, reported 
in the sequel, we applied Algorithm 12.21 to CFPs without knowing whether 
or not they are consistent. Convergence is diagnosed by performing plots of 
a proximity function that measures in some manner the infeasibility of the 
system. We used the weighted proximity function of the form 

m 

p{x) := (1/2) Y, Wi II Pi{x) - X ||2 (2.4) 

were Pi{x) is the orthogonal projection of the point x onto Qi. To combat 
instabilities in those plots that appeared occasionally in our experiments 
we used steering parameters a^ instead of the relaxation parameters a^ in 
Algorithm 12.21 To this end we need the following definition. 

Definition 2.3 A sequence {o"fc}fcLo ^/ ^^^^ numbers < cr^ < 1 zs called a 
steering sequence if it satisfies the following conditions: 

(2.5) 
(2.6) 

(2.7) 

A historical and technical discussion of these conditions can be found 
in [3]. The sequential and simultaneous Halpern-Lions-Wittmann-Bauschke 





lim o-fc = 0, 




oo 




'Y(rk = +00, 




fc=0 


oo 




Y. 


(Jfc - ak+m < +00 


k=0 





(HLWB) algorithms discussed in Censor [TB] employ the parameters of a 
steering sequence to "force" (steer) the iterates towards the solution of the 
best approximation problem (BAP). This steering feature of the steering 
parameters has a profound effect on the behavior of any sequence of iterates 
{x^}^o- ^^ return to this point in Section [6l 

Algorithm 2.4 The method of simultaneous subgradient projections (SSP) 
with steering. 

Initialization: x° G i?" is arbitrary. 

Iterative step: (i) Given x^, calculate, for all i E I = {1,2, . . . ,m\, 
intermediate iterates y^+^'^ by 

k+l,i ^ } ^ ~ '^ku * fc ||2^ 5 if fi\^ ) > 0, /2 g\ 

\ a;^ if /,(x^) < 0, 

where t^ G dfi{x^) is a subgradient of fi at the point x^ , and {(Jk] 
sequence of steering parameters. 

(ii) Calculate the next iterate x^^^ by 



k=0 



X^+1 



Y.'^^y'^'" (2-9) 



i=l 



where Wi are fixed, user-chosen, positive weights with Xli^i "^i = 1- 

3 Subgradient projections with strategical re- 
laxation: Preliminaries 

Considering the CFP (II. 2p . the envelope of the family of functions {/i}^i is 
the function 

f{x) := max{/i(x) \ i = 1,2, . . . ,m} (3.1) 

which is also convex. Clearly, the consistent CFP is equivalent to finding a 
point in 

Q = n^iQ, = {x G i?'^ I fix) < 0}. (3.2) 

The subgradient projections algorithmic scheme that we propose here em- 
ploys a strategy for controlling the relaxation parameters in a specific manner, 
leaving enough user-flexibility while giving some mathematical guarantees for 
the algorithm's behavior in the inconsistent case. It is described as follows: 



Algorithm 3.1 

Initialization: Let M be a positive real number and let x^ G -R" be any 
initial point. 

Iterative step: Given the current iterate x^, set 

/(x^) := {z I 1 < i < m and fi{x^) = f{x'')} (3.3) 

and choose a nonnegative vector w'' = {w^, W2, . . . , w^) G R"^ such that 

m 

J2w-=l andw'l = if iil{x^). (3.4) 

4 = 1 

Let Afc be any nonnegative real number such that 

max (0, /(x'^)) < AfcM^ < 2 max (O, f{x^)) (3.5) 

and calculate 

where, for each i G /(x^), we take a subgradient ^f G dfi{x^). 

It is interesting to note that any sequence {x'^}^_ generated by this al- 
gorithm is well-defined, no matter how x^ and M are chosen. Similarly to 
other algorithms described above, Algorithm 13. II requires computing subgra- 
dients of convex functions. In case a function is different iable, this reduces 
to gradient calculations. Otherwise, one can use the subgradient computing 
procedure presented in Butnariu and Resmerita [12j . 

The procedure described above was previously studied in Butnariu and 
Mehrez [TTj. The main result there shows that the procedure converges to 
a solution of the CFP under two conditions: (i) that the solution set Q has 
nonempty interior and (ii) that the envelope / is uniformly Lipschitz on i?", 
that is, there exists a positive real number L such that 

\f{x)-f{y)\<L\\x-y\\, for all x, y G i?^ (3.7) 

Both conditions (i) and (ii) are restrictive and it is difficult to verify their 
validity in practical applications. In the following we show that this method 
converges to solutions of consistent CFPs under less demanding conditions. 
In fact, we show that if the solution set Q of the given CFP has nonempty 

9 



interior, then convergence of Algorithm 13.11 to a point in Q is ensured even 
if the function / is not uniformly Lipschitz on i?" (i.e., even if / does not 
satisfy condition (ii) above). However, verifying whether int Q 7^ prior to 
solving a CFP may be difficult or even impossible. Therefore, it is desirable 
to have alternative conditions, which may be easier to verify in practice, that 
can ensure convergence of our algorithm to solutions of the CFP, provided 
that such solutions exist. This is why we prove convergence of Algorithm 13. II 
to solutions of consistent CFFs whenever the envelope / of the functions fi 
involved in the given CFP is strictly convex. Strict convexity of the envelope 
function / associated with a consistent CFP implies that either the solution 
set Q of the CFP is a singleton, in which case int Q = 0, or that Q contains 
(at least) two different solutions of the CFP implying that int Q ^ 0. Veri- 
fication of whether Q is a singleton or not is as difficult as deciding whether 
int Q 7^ 0. By contrast, since / is strictly convex whenever each fi is strictly 
convex, verification of strict convexity of / may be relatively easily done in 
some situations of practical interest, such as when each fi is a quadratic con- 
vex function. In the latter case, strict convexity of fi amounts to positive 
definiteness of the matrix of its purely quadratic part. 

It is interesting to note in this context that when the envelope / of the 
CFP is not strictly convex one may consider a "regularized" CFP in which 
each fi which is not strictly convex is replaced by 

Ji{x) := fi{x) + aWxW" (3.8) 

for some positive real number a. Clearly, all fi are strictly convex and, thus, 
so is the envelope / of the regularized problem. Therefore, if the regularized 
problem has solutions, then our Algorithm 13.11 will produce approximations 
of such solutions. Moreover, any solution of the regularized problem is a so- 
lutions of the original problem and, thus, by solving the regularized problem 
we implicitly solve the original problem. The difficult part of this approach is 
that, even if the original CFP is consistent then the regularized version of it 
may be inconsistent for all, or for some, values a > 0. How to decide whether 
an a > exists such that the corresponding regularized CFP is consistent 
and how to compute such an a (if any) are questions whose answers we do 
not know. 
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4 Subgradient projections with strategical re- 
laxation: convergence analysis 

In order to discuss the convergence behavior of the subgradient projections 
method with strategical relaxation, recall that convex functions defined on 
the whole space i?" are continuous and, consequently, are bounded on bounded 
sets in i?". Therefore, the application of Butnariu and lusem fiUi Proposition 
1.1.11] or Bauschke and Borwein [H Proposition 7.8] to the convex function 
/ shows that it is Lipschitz on bounded subsets of K^, i.e., for any nonempty 
bounded subset S C K^ there exists a positive real number L{S), called a 
Lipschitz constant of f over the set S, such that 

\f{x)-f{y)\<L{S)\\x-y\\,ioTa\\ x,y e S. (4.1) 

Our next result is a convergence theorem for Algorithm 13.11 when applied to 
a consistent CFP. It was noted in the previous section that Algorithm 13.11 is 
well-defined regardless of how the initial point x^ or the positive constant M 
involved in the algorithm are chosen. However, this is no guarantee that a 
sequences {xfcj^Q generated by Algorithm 13. II for random choices of x° and 
M will converge to solutions of the CFP, even if such solutions exist. The 
theorem below shows a way of choosing x° and M which ensures that, under 
some additional conditions for the problem data, the sequence {x^J^q gen- 
erated by Algorithm 13.11 will necessarily approximate a solution of the CFP 
(provided that solutions exist). As shown in Section below, determining a;° 
and M as required in the next theorem can be quite easily done for practi- 
cally significant classes of CFPs. Also, as shown in Section [6l determining x^ 
and M in this manner, enhances the self-adaptability of the procedure to the 
problem data and makes Algorithm 13.11 produce approximations of solutions 
of the CFP which, in many cases, are more accurate than those produced by 
other CFP solving algorithms. 

Theorem 4.1 If a positive number M and an initial point x^ in Algorithm 
\3.1\ are chosen so that M > L{B{x^,r)) for some positive real number r 
satisfying the condition 

5(x°,r/2)ng^0, (4.2) 

and if at least one of the following conditions holds: 
(i) E(a;°,r/2)nint Q ^ 0, 
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(a) the function f is strictly convex, 
then any sequence {x''^ , generated by Algorithm I5'.i|. converges to an 
element of Q. 

We present the proof of Theorem 14.11 as a sequence of lemmas. To do 
so, note that if {x'^} is generated by Algorithm 13. II then for each integer 
A; > 0, we have 



where 

^':= Y. ^fef econvU,g,(,,.)9/,(x^). (4.4) 

Using (14. 3p . for any z E K", we have 

„fc+l ^||2 _ II^A: ^||2 -, /-, ||,,fc||2 o /,,*: ^k 



||a;'=+i _ z\\ = Wx" - z\\ + Afc i\k \y\\ - 2 (^^^ x'^ - z)) . (4.5) 

By Clarke [211 Proposition 2.3.12] we deduce that 

a/(x'=) = convUie/(.^)9/,(x^) (4.6) 

and this implies that u^ G df{x^) because of (14.41) . Therefore, 

{u',z-x')<f'4x';z-x') (4.7) 

where f^{u;v) denotes the right-sided directional derivative at u in the di- 
rection V. Now suppose that M, r and x° are chosen according to the require- 
ments of Theorem 14. 1|, that is, 

r > 0, M> L(5(x°,r)) and5(x°,r/2)nQ ^0. (4.8) 

Next we prove the following basic fact. 

Lemma 4.2 // ^4-^ ^^ satisfied and if z G B{x^,r/2)nQ, then for all k > 0, 
we have for any sequence {x^\ , generated by Algorithm \3.1\ 

a;^'+^ e E(a;°, r) and ||x^+^ - ^11 < Ik^ - -zll < r/2. (4.9) 
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Proof. We first show that if, for some integer k > 0, 

x^ e B{x^, r) and ||x'' - ;z|| < r/2 (4.10) 

then (14.91) holds. If A^ = or u'' = 0, then, by (14.31) . we have x'^^^ = x'' 
which, combined with (14.101) . implies (14. 9p . Assume now that A^ 7^ and 
z/'^ ^ 0. Since, by ( iTTOj) . x'' G 5(x°,r), by dM]) and by [211 Proposition 
2.1.2(a)], we deduce that 

M> L(5(x°,r)) > ||z/^||. (4.11) 

According to (13. 5|) . we also have f{x^) > (otherwise A^ = 0). Since f{z) < 
we obtain from the subgradient inequality 

(^/^ x''-z)> fix'') - f{z) > fix'') > 0. (4.12) 

This and (14.111) imply 

2(^/^a;^-2) > 2/(s'=) > A^M^ > A^ ||z/'=||\ (4.13) 



showing that the quantity inside the parentheses in (14.51) is nonpositive. 
Thus, we deduce that 

||x'=+^ - z|| < llx'' - z|| < r/2 (4.14) 

in this case too. This proves that if (I4.10p is true for all A; > 0, then so is 
(14.91) . Now, we prove by induction that (I4.10p is true for all fc > 0. If A; = 
then (I4.10p obviously holds. Suppose that (14.100 is satisfied for some k = p. 
As shown above, this implies that condition (14. 9 p is satisfied for k = p and, 
thus, we have that 

xP+^ G 5(x°,r) and ||xP+^ - z\\ < r/2. (4.15) 

Hence, condition (I4.10p also holds for k = p + 1. Consequently, condition 
(14. 9p holds for k = p + 1 and this completes the proof. ■ 

Observe that, according to Lemma 14.21 if {a;'^},_ is a sequence gener- 
ated by Algorithm 13.11 and if the conditions (14. 8 p are satisfied, then there 
exists z G B{x'^, r/2) fl Q and for any such z the sequence { ||a;'^ — 2;|| }^g is 
nonincreasing and bounded from below and, therefore, convergent. Since the 
sequence {||a;'^ — ^||}^q is convergent it is also bounded and, consequently, 
the sequence {a:'^}^^^ is bounded too. This shows that the next result applies 
to any sequence {x''^ generated by Algorithm 13. II under the assumptions 
of Theorem 14.11 
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Lemma 4.3 // {x'^j is a hounded sequence generated by Algorithm \3.1\ 



k=0 

oo 



then the sequence {x'^^ has accumulation points and for each accumulation 
point X* of {x^} f^^^ there exists a sequence of natural numbers {fc^j^o ^'^'^h 
that the following limits exist 



X* == lim x'^", A^, 


= lim \k,, 

S—KX) 




(4.16) 


c = lim ef% w* 


= lim ti?f% for all i = 1, 2, . . 

S — ^(X) 


.,m, 


(4.17) 


V* = lim u''% 

s— >oo 






(4.18) 


and we have 








w* := iwl,w;,. 


..,w*JeR^ and y^ w*- 

iei{x*) 


= 1 


(4.19) 



and 

Moreover, if X^, = 0, then x* is a solution of the CFP. 

Proof. The sequence {x''}^^^ is bounded and, thus, has accumulation 
points. Let x* be an accumulation point of [x^^^^^ and let {x^^J^q be a con- 
vergent subsequence of {x^^ such that x* = lim^^ooa^^^- The function / 
is continuous (since it is real- valued and convex on K^), hence, it is bounded 
on bounded subsets of K^. Therefore, the sequence {/(a;^0}^o converges 
to f{x*) and the sequence {f{x'')}f^^Q is bounded. By (13.51) . boundedness 
of {f{x'')}j^^Q implies that the sequence {Afcj^g is bounded. Since, for ev- 
ery i = 1,2, ... ,m, the operator dfi : R^ — >■ 2^" is monotone, it is locally 
bounded (cf. Pascali and Sburlan [HI Theorem on p. 104]). 

Consequently, there exists a neighborhood U of x* on which all dfi, 
i = 1,2, ... ,m, are bounded. Clearly, since x* = UtHs-^oo x^% the neighbor- 
hood U contains all but finitely many terms of the sequence {xP"}'^q . This 
implies that the sequences {^f'l^o ^^^ uniformly bounded and, therefore, 
the sequence {^'^''j^o ^^ bounded too. 

Therefore, there exist a subsequence {ks}'^Q of {ps}^o such that the 
hmits in (I4.16p - (l4.18p exist. Obviously, the vector w* = {wl, w^, . ■ . , w^) e 
K^, and, according to |lli Lemma 1], we also have J2i(^i(x*)'^i ~ ^- This 
and dOD imply that u* = E.e/(.*)<C- 
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Observe that, since z/*^" G df{x^'') for all s > 0, and since df is a closed 
mapping (cf. Phelps [IHl Proposition 2.5]), we have that v* G df{x*). Now, 
if A* = then, according to (13.51) . and the continuity of /, we deduce 

< max{0,/(2;*)} = lim max{OJ{x''')} < lim A^.M^ = KM^ = 0, 

s— »oo s— >oo 

(4.21) 
which implies that f{x*) < 0, that is, x* E Q. ■ 

Lemma 4.4 Let {x*"'}. be a sequence generated by Alaorithm \3.1[ If Hj-S^ 
is satisfied and if at least one of the conditions (i) or (ii) of Theorem \4-l\ 
holds, then the sequence {x'^}, _ has accumulation points and any such point 
belongs to Q. 

Proof. As noted above, when (14. 8 p is satisfied then the sequence {x'^},_ 
is bounded and, hence, it has accumulation points. Let x* be such an ac- 
cumulation point and let {^sj^o ^^ ^^^ sequence of natural numbers asso- 
ciated with X* whose existence is guaranteed by Lemma 14.31 Since, for any 
z G C n B{x^,r/2), the sequence |||a:;'^ — 2;||},_„ is convergent (cf. Lemma 
14.21) we deduce that 

||x* - z\\ = lim llx''' - 2:11 = lim llx'' - z\\ = lim ||x''''+-^ - z\\ (4.22) 

s^oo fc— >oo s— >oo 

= ||x* - A*Z/* - z\\ . 
This implies 

II * l|2 II * ||2 . \ /\ II *||2 o / * * \\ //I oo\ 

\\x — z\\ = \\x — z\\ +A*(A*||z/|| — 2(z/,a; — z)) . (4.23) 

If A* = 0, then x* E Q hj Lemma H751 Suppose that A^, > 0. Then, by (I4.23p . 
we have 

X,\\u*f -2{u*,x* -z) = 0, (4.24) 

for all 2; G C n B{x^,r/2). We distinguish now between two possible cases. 

Case I: Assume that condition (i) of Theorem 14. II is satisfied. According 
to (14.240 ■ the set Q fl B{x^, r/2) is contained in the hyperplane 

H:= {xeK'l (z/*,x) = (l/2)(2(z/*,x*)-A*||z/*f)}. (4.25) 

By condition (i) of Theorem \4-l\ it follows that int {Q {~\ B{x^ ,r/2)) 7^ 



and this is an open set contained in int H. So, unless z/* = (in which case 
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H = -R"), we have reached a contradiction because intH = 0. Therefore, we 
must have u* = 0. According to Lemma [4. 3[ we have = z/* G df{x*) which 
imphes that x* is a global minimizer of /. Consequently, for any z & Q we 
have f{x*) < f{z) < 0, that is, x* e Q. 

Case II: Assume that condition [ii) of Theorem 14. II is satisfied. Accord- 
ing to fl4.24p . we have 

\^\yf = 2{u*,x*-z). (4.26) 

By (13. 5p . the definition of M and |24| Proposition 2.1.2] we deduce that 

2f{x'')>\k.M'>Xk^y'\\\ (4.27) 

for all integers s > 0. Letting s —>■ oo we get 

2f{x*) > A,M2 > A, \\u*f = 2 {u*,x* - z) , (4.28) 

where the last equality follows from (14.261) . Consequently, we have 

f{x*)>{u*,x*-z), for all z e Q f] B{x'^,r/2). (4.29) 

Convexity of / implies that, for all z & Q (1 B{x^, r/2), 

- fix*) <{iy*,z- X*) < f{z) - fix*) < -fix*). (4.30) 

Therefore, we have that 

- f{x*) = {u*,z~ X*) = fiz) - fix*), for all zeQf] 5(a;°, r/2). (4.31) 

Thus fiz) = 0, for all 2; G Q fl i?(x°,r/2). Hence, using again the convexity 
of /, we deduce that, for all z E Q H -B(x°, r/2), 

f'^ix*; z-x*)< fiz) - fix*) = -fix*) = {u*, z-x*)< f'^x*; z-x*). 

(4.32) 
This implies 

/;(x*; z~x*) = {u*, z-x*) = fiz) - fix*), for all z G Q n 5(a;°, r/2). 

(4.33) 
Since, by condition (ii) of Theorem \4-l\ f is strictly convex, we also have 
(see [To], Proposition 1.1.4]) that 

f'^ix*; z-x*)< fiz) - fix*), for all z e {Q n 5(x°, r/2)) \{a;*}. (4.34) 
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Hence, the equalities in (14.331) cannot hold unless Qf\B{x^ ,r /2) = {x*} and, 
thus, X* e Q. m 

The previous lemmas show that if (14. 8p holds and if one of the conditions 
(i) or (ii) of Theorem 14. II is satisfied, then the sequence {x^}^ is bounded 
and all its accumulation points are in Q. In fact, the results above say some- 
thing more. Namely, in view of Lemma 14.21 they show that if (14.81) holds 
and if one of the conditions (i) or (ii) of Theorem 14.11 is satisfied, then all 
accumulation points x* of {a^'^j^^Q are contained in Q Ci B{x^,r) because all 
x^ are in B{x^,r) by (14.91) . In order to complete the proof of Theorem 14. ![ 
it remains to show that the following result is true. 



Lemma 4.5 Under the conditions of Theorem \4.1\ any sequence {x''^, , 



generated by Alaorithm \3.1\ has at most one accumulation point. 

Proof. Observe that, under the conditions of Theorem 14.11 the conditions 
]8]) are satisfied and, therefore, the sequence {a;'^}^^^ is bounded. Let x* 



be an accumulation point of {x'^J^^q . By Lemma 1^^ we deduce that x* G Q, 
i.e., /(x*) < 0. Consequently, for any natural number k we have 

{v\x^-x*)>f{x^)-f{x*)>f{x^). 

Now, using this fact, a reasoning similar to that which proves ( I4.13P but 
made with x* instead of z leads to 

2{v\x^ -X*) >\k\yf , 



for all natural numbers k. This and (14. 5 p combined imply that the sequence 
|||x^ — x*||} is non-increasing and, therefore, convergent. Consequently, 
if {x'^''} _ is a subsequence of {x'^}^_ such that limp^oo x'^'' = x*, we have 



lim X -X* = lim \\x^ - x*\\ = 0, 
showing that any accumulation point x* of {x'^} is exactly the limit of 

K}r=o- ■ 

The application of Theorem 14.11 dep ends on our ability to choose numbers 
M and r and a vector x° such that condition (14.81) is satisfied. We show below 
that this can be done when the functions fi of the CFP (II. 2p are quadratic 
or afiine and there is some a priori known ball which intersects Q. In actual 
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applications it may be difficult to a priori decide whether the CFP 01. 2p has 
or does not have solutions. However, as noted above, Algorithm 13. II is well- 
defined and will generate sequences {x'^} no matter how the initial data 
M, r and x° are chosen. This leads to the question whether it is possible to 
decide if Q is empty or not by simply analyzing the behavior of sequences 
jx'^} generated by Algorithm 13. 1[ A partial answer to this question is 
contained in the following result. 

Corollary 4.6 Suppose that the CFP / IJ.^j) has no solution and that the 
envelope f is strictly convex. Then, no matter how the initial vector a;° 
and the positive number M are chosen, any sequence {x'^} , generated by 
Algorithm \3.1\ has the following properties: 
(i) If {a;'^},_ is bounded and 

lim llx^+^-x'^ll =0, (4.35) 

then f has a (necessarily unique) minimizer and {x'^'}, converges to that 
minimizer while 

lim f{x^) = inf{/(x) | x E R""}. (4.36) 

fc^oo 

(a) If f has no minimizer then the sequence {x^},_ is unbounded or the 
sequence |||x'^"'"^ "~3;'^||},_ does not converge to zero. 

Proof. Clearly, (ii) is a consequence of (z). In order to prove (i) observe 
that, since the CFP (II. 2p has no solution, all values of / are positive. Also, if 
/ has a minimizer, then this minimizer is unique because / is strictly convex. 
If {x^^ is bounded then it has an accumulation point, say, x*. By 
Lemma 14.31 there exists a sequence of positive integers {fcsj^Q such that 
(I4.16P and (14.190 - 04. 201) are satisfied. Using Lemma 14.31 again, we deduce 
that, if the limit A,, in (14.160 is zero, then the vector x* = linis^oo x^" is a 
solution of the CFP (II. 2p . i.e., f{x*) < 0, contradicting the assumption that 
the CFP ([OD has no solution. Hence, A, > 0. By (gSD, (H^ and KTM we 
have that 

= lim Xk^' = Ku*. (4.37) 

s— >oo 

Thus, we deduce that v* = 0. From Oi:T9D - (l4:20D and [24, Proposition 2.3.12] 
we obtain 

= u*= J2 <C* e df{x*), (4.38) 

i£l{x*) 
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showing that x* is a miniinizer of /. So, all accumulation points of jx'^} 
coincide because / has no more than one minimizer. Consequently, the 
bounded sequence {a;'^},_ converges and its limit is the unique minimizer 
of/. ■ 

Remark 4.7 Checking numerically a condition such as (ii) in Corollary \4 -61 
or the condition in Corollary \4.9\ below seems virtually impossible. But there 
is no escape from such situations in such mathematically- oriented results. 
Condition (ii) in Corollary \4.6\ is meaningful in the inconsistent case in which 
a feasible point does not exist but a proximity function that "measures" the 
feasibility violation of the limit point can be minimized. An easy adaptation 
of the proof of Corollary 4-6 shows that if the sequence {x'^ } , _ has a bounded 
subsequence {x'^*} such that the limit \iiat^oo{x'^^~^^ — x*^*) = 0, then all 
accumulation points of {x'^*} are minimizers of f (even if f happens to 
be not strictly convex). 

Remark 4.8 The fact that for some choice ofx^ and M a sequence {x'^} , 
generated by Algorithm \3.1[ has the property that \m\k^oo f {x^) = 0, does 
not imply that the CFP U.2\) has a solution. For example, take in / li.^j) 
m = n = 1 and /i(x) = e~^. Clearly, in this case U.2^) has no solution 
and f = /i. However, for x° = 0, M = 1 and Xk = (3/2)/(x'^), we have 
limk^oofix'') = 0. 

A meaningful implication of Corollary 14.61 is the following result. 

Corollary 4.9 Suppose that the CFP (2) has no solution and that f is 
strictly convex. Then, no matter how the initial vector x° and the positive 
number M are chosen in Algorithm \3.1[ the following holds: If the series 
YlT=o W^'' ~ x'^^^W converges, then the function f has a unique global mini- 
mizer and the sequence {x'^}, , generated by Algorithm \3.1[ converges to 
that minimizer while the sequence {/(x^)}^^^ converges to inf{/(x) | x G 

Proof. When X^fe^o IN'^ ~ x'^'^-^ll converges to some number S we have 

k 

||x° - x^'+MI < y llx^ - x^+MI < S, (4.39) 

11 11 / ^ 11 11 ' ^ ^ 

i=0 
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for all integers k > 0. This implies that the sequence {x^}^_ is bounded and 
linifc_»oo lla;'^ — x'^+^ll = 0. Hence, by applying Corollary 4.6, we complete the 
proof. ■ 

Remark 4.10 Finding an initial vector x^, the radius r and a positive num- 
ber M satisfying condition M > L{B{x^,r)) (and satisfying Ij^^ provided 
that Q is nonempty) when there is no a priori knowledge about the exis- 
tence of a solution of the CFP can be quite easily done when at least one 
of the sets Qi, say Qig, is bounded and the functions fi are difjerentiable. 
In this case it is sufficient to determine a vector x° and a positive number 
r large enough so that the ball B{x^,r/2) contains Qi^. Clearly, for such a 
ball, if Q is nonempty, then condition ( [^.ijp holds. Once the ball B{x'^,r) is 
determined, finding a number M > L{B{x^,r)) can be done by taking into 
account that the gradients of the differentiable convex functions fi : i?" — »• R 
are necessarily continuous and, therefore, the numbers 

U = sup{||V/,(x)|| I X G 5(x°,r)} (4.40) 

are necessarily finite. Since L := maxjLj \ 1 < i < m} is necessarily a 
Lipschitz constant of f over B{x^,r), one can take M = L. 

Remark 4.11 The method of choosing x°, r and M presented in Remark 
4.10\ does not require a priori knowledge of the existence of a solution of 



the CFP and can be applied even when Q is empty. In such a case one 
should compute, along the iterative procedure of Algorithm \3.1[ the sums 
Sk = ^^=0 11^^ ^ ^^^""^11 • Theorem 4-1 o,nd Corollary 4-9 then provide the 



following insights and tools for solving the CFP, provided that f is strictly 
convex: 

• If along the computational process the sequence Sk remains bounded 
from above by some number S* while the sequence {/(a;'^)}^o stabilizes 
itself asymptotically at some positive value, then the given CFP has no 
solution, but the sequence {x^}, still approximates a global minimum 
of f which may be taken as a surrogate solution of the given CFP. 

• If along the computational process the sequence Sk remains bounded 
from above by some number S* while the sequence {f {x^)}'kLo stabilizes 
itself asymptotically at some nonpositive value, then the given CFP 
has a solution, and the sequence {x'^}, approximates such a solution. 
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5 Implementation of Algorithm 13.11 for linear 
or quadratic functions 

Application of Algorithm 13.11 does not require knowledge of the constant r. 
However, in order to implement this algorithm, such that the conditions for 
convergence will be guaranteed, we have to determine numbers r and M 
required by Theorem 14. 1[ The method proposed in Remark 14.101 might yield 
a very large value of r. This is due to the mathematical generality of Remark 
14.101 The quadratic and afiine cases treated next seem to be restrictive from 
the theoretical/mathematical point of view, but their importance lies in the 
fact that they cover many significant real-world applications. 

We deal first with the problem of determining a number M such that 

M>L{B{x\r)), (5.1) 

provided that an r > is given. Recall that if g : i?" —>■ R is a continuously 
differentiable function then, by Taylor's formula, we have that, whenever 
x,y E i?(x°, r), there exists a -u G [x, y] such that 

I 9{y)-9{x) 1 = 1 {Vg{u),y-x) \< ||V^(m)|| \\y - x\\ 

< \\y - x\\ max{||V5f(M)|| | u E B{x^,r)}. (5.2) 

This shows that 

max{||V^(n)|| | u G 5(s°,r)} (5.3) 

is a Lipschitz constant for g on B{x^,r). Suppose now that each function /j 
is either linear or quadratic. Denote h = {i \ 1 < i < m, fi is linear} and 
/2 = {i I 1 < i < m, fi is quadratic}. Namely, 

fi{x) = {a\ x) + hi, for all i G /i, (5.4) 

with a' G i?"\ {0} and h G i?, and 

fi{x) = {x, Uix) + (a*, x) + hi, for all i G /2, (5.5) 

where Ui = {u\ ^) is a symmetric positive semidefinite n x n matrix, a* G R^ 
and hi E R. We have, of course, 
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so that fl5.3p can give us Lipschitz constants for each /j over B{x^, r). Denote 

T ._ / ll«1' if^G/i, . . 

I 2||f/.|L(||xO||+r) + ||a^||, iUeh, ^^'^^ 

where imjUo^ is the operator norm of f/j. Due to (14. 6p . this imphes that 
UxeB(xO,r)5/(a;) C 5(0, L) where 

L := niax{Lj \ 1 < i < m}. (5-8) 

Taking ^ G df{x) and C G df{y), for some x,y E B{x^,r), we have 

i^||a;-y|| > IICII llx-yll > {C,y-x) > f{y)- f{x) 

>{^,y-x)>- U\\ \\x - y\\ > -L \\x - y\\ , (5.9) 

which imphes 

\f{y)- f{x)\<L\\x-y\\, ioT a\\ x,y e B{x^,r). (5.10) 

In other words, L is a Lipschitz constant of / over B{x^,r). Thus, given an 
r > 0, we can take M to be any number such that M > L. Note that choosing 
x^ such that the corresponding r is small may speed up the computational 
process by reducing the number of iterations needed to reach a reasonably 
good approximate solution of the CFP. In general, determining a number r is 
straightforward when one has some information about the range of variation 
of the coordinates of some solutions to the CFP. 

For instance, if one knows a priori that the solutions of the CFP are 
vectors x = {xj)'^^^ such that 

£j < Xj < Uj, I < j < n, (5-11) 

where, ij,Uj G R, for all j, then the set Q is contained in the hypercube 
of edge length 6 = Wmax — ^min, whose faces are parallel to the axes of the 
coordinates, and centered at the point x° whose coordinates are x^ = |(^mm+ 
Mmax), where 



mmH 



1 < j < ""■} and Mmax := niaxJMj | 1 < j < n}. (5.12) 



Therefore, by choosing this x^ as the initial point for Algorithm 13.11 and 
choosing r = ^/n6, condition (14. 2 p holds. 
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6 Computational results 

In this section, we compare the performance of Algorithms 12.21 12.41 and 13.11 
by examining a few test problems. There are a number of degrees-of-freedom 
used to evaluate and compare the performance of the algorithms. These are 
the maximum number of iterations, the number of constraints, the lower and 
upper bounds of the box constraints, the values of the relaxation parameters, 
the initial values of the steering parameters and the steering sequence. In all 
our experiments, the steering sequence of Algorithm 12.41 assumed the form 

... = ^ (6.1) 

with a fixed user-chosen constant a. The main performance measure is the 
value of f{x^), plotted as a function of the iteration index k. 

6.1 Test problem description 

There are three types of constraints in our test problems: Box constraints, lin- 
ear constraints and quadratic constraints. Some of the numerical values used 
to generate the constraints are uniformly distributed random numbers, lying 
in the interval r = [Ti,r2], where Ti and T2 are user-chosen pre-determined 
values. 

The n box constraints are defined by 

^j < Xj < Uj, j = 1,2, . . . ,n (6.2) 

where ij,Uj G r are the lower and upper bounds, respectively. Each of the 
Nq quadratic constrains is generated according to 

Gi{x) = {x,Uix) + {v\x) + (3i, t = l,2,...,Ng. (6.3) 

Here f/j is are n x n matrices defined by 

[/, = W^AW^, (6.4) 

the n X n matrices Aj are diagonal, positive definite, given by 

A, = diag {61, 6i,..., 61) (6.5) 

where < S{ < (^2 < . . . < 5^ G r are generated randomly. The matrices Wi 
are generated by orthonormalizing an n x n random matrix, whose entries 
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lie in the interval r. Finally, the vector f* G i?" is constructed so that all 
its components lie in the interval r and similarly the scalar /3j G r. The A^^ 
linear constraints are constructed in a similar manner according to 



Li{x) = {y\ x) + -fu ^ = 1,2, 



Np. 



(6.6) 



Thus, the total number of constraints is n + Nq + Np. 

Table 16.11 summarizes the test cases used to evaluate and compare the 
performance of Algorithms 12.21 12.41 and 13. 1[ In these eight experiments, we 
modified the value of the constant a in fl6.ll) . the interval r, the number of 
constraints, the number of iterations, and the relative tolerance e, used as a 
termination criterion between subsequent iterations. 



Case 


a/a/\ 


r 


n 


N, 


Np 


Iterations 


e 


1 


1.1 




3 


5 


5 


1,000 




2 


1.1 




3 


5 


5 


1,000 




3 


1.98 




3 


5 


5 


1,000 




4 


1.98 


-0.1,0.1] 


30 


50 


50 


1,000 


0.1 


5 


1.98 


[-10,10] 


30 


50 


50 


100,000 


0.1 


6 


2 


[-0.1,0.1] 


30 


50 


50 


1,000 


0.1 


7 


3 


[-10,10] 


3 


5 


5 


1,000 


0.1 


8 


5 


[-0.1,0.1] 


3 


5 


5 


1,000 


0.1 



Table 6.1: Test cases for performance evaluation 



In Table |6TT| Cases 1 and 2 represent small-scale problems, with a total of 
13 constraints, whereas Cases 4-6 represent mid-scale problems, with a total 
of 130 constraints. Cases 6-8 examine the case of over relaxation, wherein 
the initial steering (relaxation) parameter is at least 2. 



6.2 Results 

The results of our experiments are depicted in Figures [IHSl The results of 
Cases 1-3 are shown in Figures [T](a)-[T](c), respectively. It is seen that in Case 
1 Algorithm 12.21 has better initial convergence than Algorithms 12.41 and 13.11 
However, in Case 2, Algorithm 12.41 yields fast and smooth initial behavior, 
while Algorithm 12.21 oscillates chaotically. Algorithm 13 . II exhibits slow initial 
convergence, similarly to Case 1. In Case 3, Algorithm 13.11 supersedes the 
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performance of the other two algorithm, since it continues to converge toward 
zero. However, none of the algorithms detects a feasible solution, since none 
converged to the tolerance threshold after the maximum number of iterations. 



The mid-sized problems of Cases 4 and 5 are depicted by Figures lU^a) 
and[2](b). Figure [2]^a) shows that Algorithm 13.11 detects a feasible solution, 
while both Algorithms 12.21 and 12.41 fail to detect such a solution. The curve of 
Algorithm 13.11 in Fig. ^a) stops when it reaches the feasible point detection 
tolerance, which is 0.1. Once the point is detected, there is no need to 
further iterate, and the process stops. The curve for Algorithm 12.21 in this 
figure shows irregular behavior since it searches for a feasible solution without 
reaching the detection threshold of 0.1 and accumulated numerical errors 
start to affect it. Figure[2](b) shows a phenomenon similar to the one observed 
in the small-scale problem: Algorithm 13.11 continues to seek for a feasible 
solution, while Algorithms 12.21 and 12.41 converge to a steady-state, indicating 
failure to detect a feasible solution. 

In the experiments. Cases 6-8, Algorithm 13.11 outperforms the other algo- 
rithms, arriving very close to finding feasible solutions. It should be observed 
that the behavior of Algorithm 13.11 observed above is the result of the way 
in which the relaxation parameters A^ are self-regulating their sizes. In Al- 
gorithm [HII] the relaxation parameter A^ can be chosen (see Equation (3.6)) 
to be any number of the form 

max(0,/(x^)) max(0,/(x^)) max(0,/(x^)) 

^k = P^ m'^ + ^^1 - Pk) ^12 = ^^- W jf2 ' 

(6.7) 
where (3k runs over the interval [0, 1] . Consequently, the size of A^ can be very 
close to zero when x^ is close to a feasible solution (no matter how Pk is chosen 
in [0, 1]). Also, Afe may happen to be much larger then 2 when x^ is far from 
a feasible solution and the number /(x'^) is large enough (note that 2 — Pk 
stays between 1 and 2). So, Algorithm l3.1l is naturally under- or over- relaxing 
the computational process according to the relative position of the current 
iterate x'^ to the feasibility set of the problem. As our experiments show, in 
some circumstances, this makes Algorithm 13. II behave better then the other 
procedures we compare it with. At the same time, the self-regulation of the 
relaxation parameters, which is essential in Algorithm 13. H may happen to 
reduce the initial speed of convergence of this procedure, that is. Algorithm 
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Figure 2: Simulation results for a mid-scale problem, comparing Algorithms 
221 [231 and EH 



13. II may require more computational steps in order to reach a point x'^ which 
is close enough to the feasibility set such that its self-regulatory features 
to be really advantageous for providing a very precise solution of the given 
problem (which the other procedures may fail to do since they may became 
stationary in the vicinity of the feasibility set). Another interesting feature of 
Algorithm 13.11 which differentiates it from the other algorithms we compare 
it with, is its essentially non-simultaneous character: Algorithm 13.11 does 
not necessarily ask for w^ > for all i G {l,...,m}. The set of positive 
weights wf which condition the progress of the algorithm at step k essentially 
depends on the current iterate x'' (see (3.4)) and allows reducing the number 
of subgradients needed to be computed at each iterative step (in fact, one can 
content himself with only one wf > and, thus, with a single subgradient 
^f). This may be advantageous in cases when computing subgradients is 
difficult and, therefore, time consuming. 

The main observations can be summarized as follows: 

1. Algorithm 13.11 exhibits faster initial convergence than the other algo- 
rithms in the vicinity of points with very small f{x^). When the algo- 
rithms reach points with small f{x^) values, then Algorithm 13.11 tends 
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Figure 3: Simulation results for small- and mid-scale problems with overre- 
laxation, comparing Algorithms 12.21 12.41 and 13.11 
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to further reduce the value of f{x^), while the other algorithms tend 
to converge onto a constant steady-state value. 

2. The problem dimensions in our experiments have little impact on the 
behavior of the algorithms. 

3. All the examined small-scale problems have no feasible solutions. This 
can be seen from the fact that all three algorithms stabilize around 

/(x'=) = 50. 

4. The chaotic oscillations of Algorithm 12.21 in the underrelaxed case is 
due to the fact that this algorithm has no internal mechanism to self- 
adapt its progress to the distance between the current iterates and 
the sets whose intersections are to be found. This phenomenon can 
hardly happen in Algorithm 13.11 because its relaxation parameters are 
self-adapting to the size of the current difference between successive 
iterations. This is an important feature of this algorithm. However, 
this feature also renders it somewhat slower than the other algorithms. 

5. In some cases, Algorithms 12.21 and 12.41 indicate that the problem has 
no solution. In contrast. Algorithm 13.11 continues to make progress 
and seems to indicate that the problem has a feasible solution. This 
phenomenon is again due to the self-adaptation mechanism, and can 
be interpreted in one of the following ways: (a) The problem indeed 
has a solution but Algorithms 12.21 and 12.41 are unable to detect it (be- 
cause they stabilize too fast). Algorithm 13.11 detects a solution provided 
that it is given enough running time; (b) The problem has no solution 
and then Algorithm 13.11 will stabilize close to zero, indicating that the 
problem has no solution, but this may be due to computing (round-off) 
errors. Thus, a very small perturbation of the functions involved in the 
problem may render the problem feasible. 

7 Conclusions 

We have studied here mathematically and experimentally subgradient pro- 
jections methods for the convex feasibility problem. The behavior of the fully 
simultaneous subgradient projections method in the inconsistent case is not 
known. Therefore, we studied and tested two options. One is the use of steer- 
ing parameters instead of relaxation parameters and the other is a variable 
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relaxation strategy which is self-adapting. Our small-scale and mid-scale ex- 
periments are not decisive in all aspects and call for further research. But one 
general feature of the algorithm with the self-adapting strategical relaxation 
is its stability (non-oscillatory) behavior and its relentless improvement of 
the iterations towards a solution in all cases. At this time we have not yet 
refined enough our experimental setup. For example, by the iteration index 
k on the horizontal axes of our plots we consider a whole sweep through all 
sets of the convex feasibility problem, regardless of the algorithm. This is a 
good first approximation by which to compare the different algorithms. More 
accurate comparisons should use actual run times. Also, several numerical 
questions still remain unanswered in this report. These include the effect of 
various values of the constant a as well as algorithmic behavior for higher 
iteration indices. In light of the applications mentioned in Section [H higher 
dimensional problems must be included. These and other computational 
questions are currently investigated. 
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